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Abstract 

Escalation with overdose control (EWOC) is a Bayesian adaptive phase I clinical trial design that produces consistent 
sequences of doses while controlling the probability that patients are overdosed. However, this design does not take 
explicitly into account the time it takes for a patient to exhibit dose limiting toxicity (DLT) since the occurrence of DLT is 
ascertained within a predetermined window of time. Models to estimate the Maximum Tolerated Dose (MTD) that use the 
exact time when the DLT occurs are expected to be more precise than those where the variable of interest is categorized as 
presence or absence of DLT, given that information is lost in the process of categorization of the variable. We develop a 
class of parametric models for time to toxicity data in order to estimate the MTD efficiently, and present extensive 
simulations showing that the method has good design operating characteristics relative to the original EWOC and a version 
of time to event EWOC (TITE-EWOC) which allocates weights to account for the time it takes for a patient to exhibit DLT. The 
methodology is exemplified by a cancer phase I clinical trial we designed in order to estimate the MTD of Veliparib (ABT- 
888) in combination with fixed doses of gemcitabine and intensity modulated radiation therapy in patients with locally 
advanced, un-resectable pancreatic cancer. 
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Introduction 

Cancer phase I clinical trials constitute the first step in 
investigating the safety of potentially promising new cytotoxic or 
biological drugs in humans. In these studies, patients are accrued 
to the trial sequentially and the dose allocated to the next patient 
depends on the doses and dose limiting toxicity (DLT) status of all 
previously treated patients. The goal is to estimate a dose level that 
is associated with a pre-determined level of DLT. 

Dose assignment is carried out after the DLT status of patients 
under observation is resolved. This occurs within one cycle of 
therapy, which typically lasts 3 to 6 weeks. The target dose y is 
referred to as the maximum tolerated dose (MTD), and is defined 
as the dose that is expected to produce DLT in a specified 
proportion 6 of patients: 

P(DLT\Dose = y) = 6. (1.1) 

Several statistical methodologies have been proposed in the 
literature to select the MTD, see [1,2,3,4] for a review. An 
important class of methods that produce consistent sequences of 
doses are Bayesian adaptive designs, such as the continual 
reassessment method (CRM) proposed by O'Quigley et al. [5] 
and its modifications [6,7,8,9,10], and the escalation with overdose 
control (EWOC) method described by Babb et al. [1 1], Zacks et al. 



[12], Tighiouart et al. [13,14,15,16], and Tighiouart and Rogatko 
[17]. ' 

A limitation with this design is that the toxicity outcome is coded 
as a binary variable, presence or absence of DLT. For trials where 
the length of a cycle of therapy is two months or longer as in 
treatments involving radiotherapy, total trial duration can be very 
long making the trial practically non-feasible. This limitation 
motivated the development of models to estimate the MTD that 
take into account the amount of time patients are under 
observation when new patients are about to enter the trial. 
Cheung and Chappell [18] extended the CRM to allow late-onset 
toxicity. Their approach was to allocate weights to account for the 
time it takes for a patient to exhibit DLT. A similar approach was 
adapted to EWOC by Mauguen et al. [19]. They showed that the 
design operating characteristics of EWOC in terms of safety and 
MTD recommendation were maintained while the length of the 
trial was reduced when compared with EWOC. These approaches 
assume that the weights are linear function of the time to follow up 
with value equal to 1 if a patient experiences DLT. This implies 
that patients who experience DLT at different time points will 
contribute the same information to the likelihood function. 

In this paper, we develop a class of adaptive Bayesian models 
that take into account not only the status of DLT during the 
observation window, but also the time it takes for the patient to 
exhibit DLT. These designs are expected to be more efficient 
when estimating the MTD since more information is being 
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collected and used in the trial. Design operating characteristics are 
studied using extensive simulations and are compared to two 
versions of EWOC and time to event EWOC (TITE-EWOC) 
described in [19]. 

Methods 

In this section, we describe our design by assuming that the risk 
of DLT given dose follows a proportional hazards model [20]. The 
design is termed EWOC-PH. 

2.1 EWOC-PH 

Let 7~i, T 2 ,..., T n be nonnegative absolutely continuous random 
variables representing time to DLT. Suppose that each patient is 
observed up to time % after he or she is given the treatment. In 
practice, t is usually equal to one cycle of therapy, equivalent to 3 
or 4 weeks since drug administration or even longer for therapies 
involving radiation. Let D n — {{T h x h 3-), i=l,...,n} be the 
observed data, where T { — min(7J, t), is the dose allocated to 
patient i, and 5i = 7[T;<t). In other words, if Si = 1, we observe a 
DLT within the observation window, otherwise, the time to DLT 
is censored at time t. Note that here, we are assuming that the 
DLT status of all patients can be resolved by the end of the cycle of 
therapy. The methodology is equally applicable to the case where 
censoring occurs before time t. For instance, the time to DLT is 
censored before time t if a patient is withdrawn from the study due 
to disease progression or if a patient exhibits a severe adverse event 
not attributed to the treatment. Following the classical definition of 
the MTD where the DLT outcome variable is binary given in 
(1.1), we define the MTD y as the dose at which a proportion 6 of 
patients exhibit DLT during the observation window [0, t], i.e. 



Pv(T<T\x = y) = 6. 



(2.1) 



The value chosen for the target probability 6 depends on the 
nature and clinical manageability of the DLT; it is set relatively 
high when the DLT is a transient, correctable or non-fatal 
condition, and low when it is lethal or life threatening. Suppose 
that dose levels in the trial are selected in the interval [X min , X max ] . 

2.1.1. Likelihood. We model the risk of DLT given dose 
h(t | x) by assuming that patients given different doses of an agent 
have proportional risks of DLT. Following Cox proportional 
hazards model [20], we have 



h(t\x) = h 0 (t; ii) exp - X min )), 



(2.2) 



where ho(t; jS) is the baseline hazard function corresponding to the 
risk of DLT for a patient given dose X min and ju is a vector of 
parameters associated with the parametric baseline hazard. The 
regression effect /? represents the fixed dose effect on the risk of 
DLT. We assume that /?>0 so that the hazard of DLT is an 
increasing function of dose. After enrolling n patients in the trial, 
the likelihood function for the parameters is 



L(fi,ti\D n )= n h(Yi\x,yi exp 

i=\ 



(2.3) 



We reparameterize model (2.2) in terms of y and p 0 , the 
probability that a DLT manifests within the first cycle of therapy 
for a patient given dose x = X min . This reparameterization is 
convenient to clinicians since y is the parameter of interest and 



prior information on p 0 may be available from other trials or from 
trials using similar agents. Since there is a one to one 
correspondence between the survival function and hazard function 



P(T>t\x) 



= S(t\x) = e~ j 
Jo 



h(s\x)ds, 



(2.4) 



it follows that 



P" min + log 



log (1-0)' 



(2.5) 



Po = l~ exp{-^o(T;/i)}, 



where Ho(t;/n)= ^ho(u;/n)du is the cumulative baseline hazard 
function. 

Assuming that the baseline instantaneous risk of DLT follows an 
exponential distribution with hazard function ho(ir 9 f£) — //, one can 
show that 



/*=--log(l-p 0 ) 

T 



1 



-log 



log (1-6) 



(2.6) 



y-X min " 0 |_log(l-p 0 ). 
and the likelihood (2.3) becomes 



n \ji exp (fi(xi - X min ))] z ' exp{ - \i Y x exp (Jifa - X min ))}. 

i= 1 



(2.7) 



Other flexible parametric baseline hazards such as Weibull and 
lognormal distributions can be used at the expense of increased 
number of parameters. Using (2.6) and (2.7), one can easily write 
the likelihood of the reparameterized model L(p 0 ,y\D^. From 
(2.6), the assumption /?>0 implies that 0<p 0 <6. 

2.1.2. Prior and Posterior Distributions. Let g(po,y) be a 
prior distribution on p 0 an d y on [0, 6] x [X min , X max ] . Using Bayes 
rule, the posterior distribution of the model parameters is 
proportional to the product of the likelihood and prior distribution 



n(poM D n)KL(p 0 ,y\D n )xg(p 0 ,y). 



(2.8) 



We designed an MCMC sampler based on the Metropolis- 
Hastings algorithm [21,22] to obtain model operating character- 
istics. We also used WinBUGS [23] to estimate features of the 
posterior distribution of the MTD and design a trial. In the 
absence of prior information about the MTD and probability of 
DLT at X min , independent vague priors are selected for p 0 and y. 

2.1.3 Trial Design. Dose levels in the trial are selected in the 
interval [X min , X max ]. The adaptive design proceeds as follows. 
The first patient receives the dose x\ — X min . If this patient 
experiences DLT within the observation window(0,i], then we 
would recommend stopping the trial. Otherwise, the marginal 
posterior cdf of the MTD given that the first patient did not exhibit 
DLT by the end of the cycle of therapy is denoted by IIi(y) = II (y 
| (t, xi, 0)). The second patient receives the dose X2=II 1 ~ 1 (a) so 
that the posterior probability of exceeding the MTD is equal to the 
feasibility bound a. This is the overdose protection property of 
EWOC, where at each stage of the design, we seek a dose to 
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allocate to the next patient while controlling the posterior 
probability of exposing patients to toxic dose levels. Suppose the 
k-th patient is ready to enter the trial at time t k . We then calculate 
Hfc-i(y) = II(y | (T h X{, i- l,...,k-\) up to time t k . Note that 
here, T { is either equal to t if patient i already finished one cycle of 
therapy with no evidence of DLT by the time patient k is ready to 
enter the trial, or Yi is the time since patient i was given dose x { 
until time t k if this patient is still at risk by this time. Otherwise, T { is 
the time to DLT for that patient. The &-th patient receives the 
dosex^ = H k ~]_ l ((x). The trial proceeds until a pre-determined 
number of patients are enrolled to the trial. At the end of the trial, 
we estimate the MTD as the median of the posterior distribution 
of y. We note that here, information on the DLT status of a patient 
past the observation window (0,t] is not used in the model due to 
the definition of the MTD in (2.1). Such data may be appropriate 
to use if one is interested in optimizing both dose and number of 
cycles as in Braun et al. [24]. 

2.2 TITE-EWOC 

Cheung and Ghappell [18] extended the CRM to allow patients 
to enter the trial continuously, and called the design time to event 
CRM, TITE-CRM. Their approach was to allocate weights to 
take into account the time it takes for a patient to exhibit DLT. 
The method was further studied by Polley [25] to accommodate 
situations where we have fast patient accrual. TITE-CRM was 
adapted to EWOC by Mauguen et al. [19]. They assumed that the 
probability of DLT is given by 



Pr(a=l|x)=F(A> + /»ix) = 



exp (A; + /?!*) 
l + expO?o + 0ix)' 



(2.9) 



where /?i>0. Letting w i =Y i /T if (5 ? = 0 and w { — 1 if (5 z -= 1, the 
corresponding likelihood function is 



L(Mx I A,) = n + fax,)]*' [1 - w,F(fi 0 + ftx,)] 1 "^. 10) 

1=1 

The model is further reparameterized in terms of po and y as in 
Section 2.1.1. These parameters are given by: 



y= ■ 



l og(fl)-log(l-fl)-jgo 

Pi 

exp(ff 0 + ftX min ) 

l+exp(j8o + ft*min)" 



(2.11) 



We note that this approach implies that patients who experience 
DLT at different time points will contribute the same information to 
the likelihood function. Trial design proceeds as described in 
Section 2.1.3. 



the longer the time t k , the higher the recommended dose for 
patient k is. 

(iii) Suppose the DLT status of the first k-2 patients has been 
resolved. If patient k-l exhibits DLT shortly after he or she is 
given dose x k _i, then the dose recommended for the next 
patient is much lower than the dose given to patient k had 
patient k-l exhibited DLT later in the cycle. 

Property (i) is the overdose protection defining characteristic of 
EWOC which is also satisfied by TITE-EWOC but not by TITE- 
CRM. Property (ii) is intuitively appealing and although not 
mentioned in [18] and [19], it is shared by both TITE-CRM and 
TITE-EWOC. In fact, the property holds because the weight 
function w in (2.10) is an increasing function of Tas is shown in the 
proof of Theorem 1 below. Property (iii) is also naturally appealing 
because the amount of dose level reduction is a decreasing 
function of the time it takes for a patient to exhibit DLT. Property 
(iii) does not hold for TITE-CRM and TITE-EWOC since 
patients who exhibit DLT at different time points contribute the 
same weight in the likelihood function (2.10). Characteristics (ii) 
and (iii) are summarized in the following theorem. 

THEOREM 1. LetD k = {(T h x h 3j),...,(T k , x h 3/J} be the data on 
the first k patients generated by the design described in Section 2.1.3 and 
II k (y;T k ) be the cdfi of y given the data D k . Let x k +\ =TL k ~ l (oL; Y k ) and 
x' k +\ = H k ~ l (oc; Y'k). Suppose that for all i = l,...,k-l, either d, = 1 or 
(T i} = (t, 0). Then, x' k +\ >x k +\ whenever Y k > Y k . Furthermore, if 
the data D k is generated by TITE-EWOC in Section 2.2 or TITE-CRM, 
and if b k — 0, then x'k+i >x k +\ whenever Y k > Y k . 

Proof. 

Let 



L(p 0 ,y\D k ) = 

log(l-po) 



k 

n 



•exp 



log(log(l-0)/log(l-po)) 



{Xi ^min) 



x exp< r,- l0g(1 -^-ex P ( l0g ( 1 ° g(1 - 9) y / l0g(1 -^-(x,.-X min ) 



be the likelihood (2.7) reparameterized in terms of p 0 an d y. To 
simplify notation and presentation of the proof, we assume that 
Z min = 0, Z min = 1, t = 1, and p 0 is fixed. Let L k (y) = L k (p 0 ,y\D k ), 
7t(y) be a proper prior density for y, and h(y) = log (1 — p 0 ) 
exp{(x^/y)log(log(l — 0)/log(l — p 0 ))}- W e note that since 
0<Pq<Q, the function h( 9 ) is negative and monotonically 
increasing in y. Using Bayes' rule, the posterior c.d.f H k (t;T k ) of 
the MTD y is 

Uk(t )= £ L k - 1 (y)( - h(y)) 3 k exp ( Y k h(y))n(y)dy 
^L k -\(y)(-h(y)fk Qxp (Y k h(y))n(y)dy 



2.3 Characteristics of EWOC-PH 

The proposed design EWOC-PH assigns dose levels to future 
patients by taking into account the most recent DLT status of 
currently and previously treated patients according to the 
following properties. 

(i) At each stage of the design, we seek a dose to allocate to the 
next patient while controlling the posterior probability of 
exposing patients to toxic dose levels. 

(ii) Suppose the DLT status of the first k-2 patients has been 
resolved. If patient k- 1 does no exhibit DLT by the time 
patient k is ready to be enrolled to the trial at time t k , then 



It follows that 



U k (t; Y k )-U k (t; Y' k ) = 

£ Lk-i (y)( - h(y)fk exp ( Y k h(y))n(y)dy 



Jo^-i(y)( 

Jo'4-i(y)(- 



-h(y)fkexv{Y k h{y))n(y)dy 
-h(y)pexp(Y' k h(y))n(y)dy 



So 4-i(y)(-My))^ exp(Y' k h(y)My)dy 



Since we are assuming that the DLT status of the first k- 1 patients 
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has been resolved, i.e., either 5 t ■ = 1 or [T h S-) = (t, 0) for i<k-l, 
then L k — \{y) = L k — \\y). Hence, 

U k (t;Y k )-U k (t; Y' k ) 



= A~ 



^ j 1 L^^L^iYX-hiy^H- Ky')) 5 k 



exp ( Y k h(y)) exp ( Y' k h(y'))n(y)n(y')dy' 'dy 



-A 



Jo It 



L k _ x (y)L k _ x (y\-h(y))H-hW)f k 



exp ( Y' k h(y)) exp ( Y k h(y'))n(y)n{y')dy 'dy 



where 



Jo Jo 

exp ( Y'k h{y'))n{y)n{y')dy'dy. 

Since Y k <Y' k ,y<y' and /z(.) is increasing in y, then 
e Yk h(y)e Y k h(y') — e Yk h(y)e Yk h(y')>0. Furthermore, since h(-) is 
negative, My)' My') is nonnegative. Hence, n^(7; Y k )>H k (t; Y' k ), 
which implies that n^ ! (a; 5^) <n^^a; F'^), that is x k+ \< 
x' k+ \. This completes the proof of the first part of Theorem 1. 

If S k =0, then the likelihood (2.10) for TITE-EWOC is L k (y) = 
L k .j(y)(\-w k (T k ) Ey;x k )), where w k (T k ) is the weight function defined 
in Section 2.2 and F(y;x k ) is the logistic function in (2.9) 
reparameterized in terms of the MTD y. Using similar calculations 
as above, we have 

U k (t-Y k )-U k (t- T k ) = A- 1 

[I l! Lk ~ 1 {y)Lk ~ 1 (yMy)7l(Y)[(Wk( Yk) - Wk( Y ' k ^ F ^ Xk ^ - F <r> x ^Wdy 

where 



L k - 1 (y)L k _ i (/) [l-w k ( Y k )F(y k ; 
[l-w k (Y' k )F(y' k ;x k )]n(y)n(y')dy'dy. 



Since ^(^1) is monotonically increasing in T k and assuming 
that F[y;x k ) is decreasing in y, which is the case for the logistic 
function, then [w k ( Y k ) - w k ( Y' k )] [F(y'; x k ) - F(y; x k )\ > 0. Hence, 
U k (t;Y k )>U k (t;Y' k ), which implies that Yi k x (d;Y k )< 
n^^a; Y' k ), that is x^ + l<x^ + l. A similar argument shows 
that the property holds for the TITE-CRM. 

2.4 Coherence of EWOC-PH 

In the design of Cancer phase I trials using cytotoxic agents, it is 
ethical not to escalate the current dose x k if patient k (currently 
treated at this dose level) exhibits DLT. Similarly, if patient k does 
not experience DLT by the end of the first cycle of therapy, then 



the dose recommended for patient k+l should not be lower than 
x k . This property is known as Coherence and was introduced by 
Cheung [26]. The CRM as proposed in [5] was shown to be 
coherent by Cheung [26] and the coherence of EWOC was 
established by Tighiouart and Rogatko [17]. For time to event 
toxicity Bayesian adaptive models, the definition of coherence has 
been extended in [26] who also showed that TITE-CRM is 
coherent. However, coherence in escalation does not have a 
practical interpretation in the case of delayed toxicities. Following 
the definition of coherence for time to event DLT in [26], one can 
easily show that EWOC-PH is also coherent. 

We note that here, properties (ii) and (iii) of Theorem 1 are 
different from the notion of coherence. Theorem 1 makes a 
statement about the dose to be given to patient k given the length 
of time patient k-l is under observation; the longer the time patient 
k- 1 is under observation with no evidence of DLT, the higher the 
dose to be allocated to patient k. A similar statement is made if the 
patient exhibits DLT. Unlike the notion of coherence, we are not 
comparing the doses of patient k- 1 and patient k. 

Simulation Studies 

3.1 Design Operating Characteristics 

In the simulation studies, we compare the operating character- 
istics of EWOC-PH with the original EWOC, TITE-EWOC and 
EWOC-W. The original EWOC introduced by Babb et al. [1 1] 
assumes that the DLT outcome is binary and dose allocation is 
carried whenever a patient is available for treatment. It is not 
necessary to wait for the DLT status of patients under observation 
to be resolved. EWOC-W, which stands for EWOC "waiting" 
works just like EWOC except that patients are enrolled to the trial 
only after the DLT status of all previously treated patients have 
been resolved. Both EWOC and EWOC-W use a logistic model 
(2.9). The designs are compared with respect to safety of the trial 
and efficiency of the estimate of the MTD by simulating m = 1 000 
trials of n = 48 patients each. Specifically, we calculated the 
average bias avebi as = m 1 • ._ 1 (% — ytme)> where y t is the 
estimate of the MTD for the z-th trial and y tme is the true MTD 
under a particular scenario, the mean square error MSE = m _1 - 
X^/-i^ z ' _ ^true) 2 ' me average proportion of patients exhibiting 
DLT \j{m-n)- = 1)> tne P ercent of trials with 

estimated MTD within 10% of the dose range of the true MTD, 
and the percent of trials with DLT rate exceeding 40%. These last 
two summary statistics approximate the probability that a given 
trial will result in an estimated MTD close to the true MTD and 
the probability that a trial will be safe, respectively. 

Dose levels have been standardized so that X min = 0 and 
^max - 1 • We took t = 1, the target probability of DLT was fixed at 
0 = 0.33, and the feasibility bound was set to a = 0.25. Indepen- 
dent uniform prior distributions were selected for p 0 and y, (p 0) y) 
~ Uniform([0, 0]x[O, 1]). We considered nine scenarios corres- 
ponding to three values for the true MTD y — 0.3, 0.5, 0.7, three 
values for the accrual rate v = 1, 2, and 4 patients per unit of time 
equal to the length of the observation window [0, t], and fixed 
value of po = 0.05. In order to make a fair comparison between the 
different models and assess the performance of EWOC-PH under 
model misspecification, we simulated the times to DLT using two 
different models as described in the next section. For each 
scenario, patients enter to the trial according to a time 
homogeneous Poisson process with rate v. 
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3.2 Models to generate time to DLT 

The first model we considered for generating the time to DLT is 
similar to the proportional hazards model (2.2) but with a Weibull 
baseline hazard function 

h(t) = ( K /l){t/X) K -\. (3.1) 

We took k — 0.5, 1, 1.5. Note that the case K — 1 corresponds to 
the exponential true model for EWOC -PH. Figure 1 shows the 
corresponding cdfs P(T<t \ X, K, dose = x) for various values of 
the true MTD 

7true5 ?true — 0-3, 0.5, 0.7 given three different doses 
x=0.2, ^ = y t rue5 * = 0.8. The solid line corresponds to the true 
model EWOC-PH and serves as a reference for departure of the 
other cdfs from the true model. Note that these curves have been 
chosen so that they have the same MTD value in each scenario. 
This is accomplished by setting P (T<t | X, K, dose — y) —9 
and P(T<t | A, K, dose = X min ) = p 0 . It then follows that X - 
[— fc/((ic+l) log(l-po))] 1/,C and jff = y" 1 logf-zc" 1 (k;+1) X K 
log(l-0)]. 

The second model we considered is a non-proportional hazards 
model 

h(t\x) = h 0 (t)e^ xI(t ^ )+ P2 xI(t>t i\ 0<h <t. (3.2) 

We used ho(t) = £ = 0.15, t\ = 0.5, and two different values for 
/? 2 , ^2 = 0.5, 2. The values for fix were selected to match the 
MTDs with the other models as above. It can be shown that Pi — 
y 1 log[— e y ^ 2 ~2b 1 log(l— 0)]. The corresponding cdfs are 
shown in Figure 2 along with the cdf of the true model which 
corresponds to the case ft\ — /?2- These models yield reasonable 
separations of the corresponding cdfs of the time to DLT from the 
true model and the extent of this separation increases with dose. 

For each of the above models, let 7^ be the time to DLT for 
patient i generated from that model under a particular scenario. If 
T->t, then the observed time to DLT is censored at t and the 
DLT response for EWOC and EWOC-W models is recorded as 
d t = 0. Otherwise, <5,- = 1 for EWOC and EWOC-W models. 



Results 

4.1 Trial Duration 

Table 1 shows the median trial duration across the m = 1 000 
simulated trials along with the first and third quartiles for the 
different designs as a function of the accrual rate. As expected, 
design of cancer phase I trials where the treatment is delayed until 
we observed the DLT status of all patients under observation can 
take at least twice as long when the expected number of available 
patients per cycle is 2, and can be more than four times longer 
when the accrual rate is 4. 

4.2 Trial Efficiency 

Figure 3 shows that EWOC-PH has smaller average bias 
relative to TITE-EWOC and the other two versions of EWOC 
under all scenarios when /c = 0.5, 1. When k- 1.5, EWOC-PH 
has a larger bias relative to TITE-EWOC but the extent of this 
difference is much smaller compared to the differences observed 
when k = 0.5. The MSE for all models shown in Figure 4 are fairly 
close in most scenarios. Similarly, Figure 5 shows that when 
/? 2 — 0.5, the absolute average bias for EWOC-PH is smaller than 
the average bias using the other 3 models except when the true 
MTD is high (y = 0.7). When ft> = 2.0, EWOC-PH has smaller 
absolute average bias relative to TITE-EWOC in 8 out of the 9 
scenarios. In the 9 th scenario corresponding to the true MTD 
y = 0.5 and accrual rate v=l, EWOC-PH and TITE-EWOC 
have about the same bias. Again, the MSE for all models shown in 
Figure 6 are very close in most scenarios. Based on these results, 
EWOC-PH does better than TITE-EWOC in the majority of 
these scenarios and model misspecifications combined. These 
cases occur when the cdf of the model from which the time to 
DLT is generated is above the cdf of the true model when the dose 
x is below the true MTD, see Figures 1 and 2. 

Figure 7 shows that EWOC-PH does better than the other 
models in terms of percent of trials with estimated MTD within 0. 1 
of the true MTD in the majority of the scenarios when k — 0.5, 1. 
The extent of this difference can be as high as 12% between 
EWOC-PH and EWOC. This occurs when the true MTD is low 
(y = 0.3) and the accrual rate is v = 1 . When k — 1.5, the results are 
mixed and depend on the value of the true MTD and accrual rate. 




Figure 1. Cumulative distribution function plots for different values of a- and the true value of the MTD / for various dose levels. 

doi:10.1371/journal.pone.0093070.g001 
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Figure 2. Cumulative distribution function plots for the non-proportional hazards model for different values of the true MTD / and 
for various dose levels. 

doi:10.1371/journal.pone.0093070.g002 



However, in eight out of the nine scenarios, the percent of trials 
with estimated MTD within 0.1 of the true MTD for EWOC-PH 
and TITE-EWOC are very close. When the time to DLT is 
generated by a non-proportional hazards model, Figure 8 shows 
that when /? 2 = 0.5, the percent of trials with estimated MTD 
within 0.1 of the true MTD using EWOC-PH is higher than the 
corresponding percentages using the other 3 models in six out of 
the nine scenarios. TITE-EWOC does better when the MTD is 
high, y = 0.7. Similarly, when /? 2 = 2.0, the percent of trials with 
estimated MTD within 0.1 of the true MTD using EWOC-PH is 
higher than the corresponding percentages using the other 3 
models in seven out of the nine scenarios. These percentages are 
very close in the case where the true MTD is y = 0.5. Based on 
these summary statistics, we conclude that EWOC-PH is a good 
alternative design to TITE-EWOC since it may result in a smaller 
bias under the majority of scenarios considered here and some 
model misspecification. 

4.3 Trial Safety 

Figures 9 and 1 0 show that the average proportion of patients 
exhibiting DLT does not exceed 6 = 0.33 using all four models for 
all scenarios and under the two model misspecifications. Further- 
more, Figures 1 1 and 1 2 show that the estimated probability that a 
prospective trial will result in an excessive number of DLTs, 
defined as a DLT rate exceeding 40%, is very small and does not 
exceed 0.04 under all 9 scenarios and all the different models for 
generating the time to DLT considered here. We conclude that in 
general, designing a trial using EWOC-PH is safe but additional ad 
hoc stopping rules for excessive toxicity should always be put in 
place in writing the protocol. 

4.4 Model Robustness 

For a given scenario, Figures 3 and 5 show that the largest 
difference in the average bias using EWOC-PH is around 0.12, or 
12% of the dose range. This occurs when the true MTD y = 0.7, 
accrual rate v=l, and K=l.5 (Figure 3) and the true MTD 
y = 0.7, accrual rate v = 1, and fi 2 = 2.0 (Figure 5). When the true 
MTD is y = 0.3, the largest difference in the average bias using 
EWOC-PH under the Weibull hazards corresponding to K — 0.5, 



1.0, 1.5 and the non-proportional hazards models corresponding 
to /? 2 = 0.5, 2.0 is about 0.03. When the true MTD is y = 0.5, the 
largest difference in these average biases is around 0.065. 
Similarly, when comparing the percent of trials with estimated 
MTD within 0.1 of the true MTD for a given scenario, Figures 7 
and 8 show that the largest differences between these percentages 
across the different models for generating the times to DLT is 
around 10%. We conclude that EWOC-PH is reasonably robust 
in terms of precision of the estimate of the MTD under the model 
misspecifications considered here. 

Figures 9 and 10 also show that EWOC-PH is robust under the 
model misspecifications considered here where the largest 
difference in the average probability of DLT is about 0.05. This 
occurs when the true MTD is y = 0.3 and the times to DLT are 
generated from a proportional hazards model with Weibull 
baseline hazards corresponding to /c = 0.5 and K— 1.5. Similarly, 
the largest difference in the percent of trials with DLT rate 
exceeding 40% is 0.036. Again, this occurs under the same 
scenario and models misspecification discussed above, see 
Figure 1 1 . 

4.5 Real Trial Example 

We designed a cancer phase I clinical trial in order to estimate 
the MTD of Veliparib (ABT-888) in combination with fixed doses 



Table 1. Median duration of the four trial designs by accrual 
rate. 





Design 


Accrual Rate 








1 


2 


4 


EWOC 


48.2(44, 52.9) 


25.2(22.8, 27.5) 


13.8(12.8, 14.9) 


EWOC-W 


86.2(81.6, 90.7) 


64.5(61.9, 67.6) 


61.6(59.8, 63) 


EWOC-PH 


48.3(44.4, 53) 


25.2(22.9, 27.5) 


13.7(12.7, 15) 


TITE-EWOC 


48.4(44.2, 52.9) 


25.3(23.2, 27.7) 


13.7(12.7, 14.8) 



The first and third quartiles are shown in parentheses. 
doi:10.1371/journal.pone.0093070.t001 
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Figure 3. Average bias of the estimated MTD for each of the four models and nine scenarios when the time to DLT is generated 
from a proportional hazards model with Weibull baseline hazard rate with parameters /• = 0.5 (left panel), >. = /, = 1.0 (middle panel), 
and /i = 1.5 (right panel). 
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of Gemcitabine and Intensity Modulated Radiation Therapy in 
patients with locally advanced, un-resectable pancreatic cancer. 
The trial has just been approved by the FDA and the first patient 
was enrolled at the time of writing of the manuscript. The MTD is 
defined to be the dose level of Veliparib that when administered to 
a patient results in a probability equal to 6 = OA that a dose 
limiting toxicity will be manifest within ten weeks, T — 10. Due to 
the long length of the observation window to resolve DLT status, 
EWOC-PH was used to design the trial. 

The dose for the first patient in the trial was set at 20 mg, 
previous results indicating this to be a safe dose. The dose for each 
subsequent patient will be determined so that, on the basis of all 
available data, the probability that it exceeds the MTD is equal to 
a pre-specified feasibility bound a = 0.25. The prior distribution of 



the MTD is based on the correlated priors model M4 described in 
Tighiouart et al. [13] where the support of the MTD is (0, °°). 
After extensive consultation with the principal investigator (PI) of 
the trial, we will assume that the a priori probability that the MTD 
exceeds 100 mg is 10%. Upon completion of the trial, the MTD 
will be estimated as the median of the marginal posterior 
distribution of the MTD. 

Figure 1 3 shows an example of a simulated trial when the true 
value of the MTD y = 70 mg and the probability of DLT at the 
initial dose po is 0.05. Patients enter the trial according to a time 
homogeneous Poisson process with an average number of 3 
patients per 10 weeks. This figure shows patients' number, the 
time when they enter the trial, the DLT status and how long it 
took to exhibit DLT if they did. To assess design operating 
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Figure 4. Mean square error of the estimated MTD for each of the four models and nine scenarios when the time to DLT is 
generated from a proportional hazards model with Weibull baseline hazard rate with parameters /r=0.5 (left panel), >. = /, = 1.0 
(middle panel), and /c- 1.5 (right panel). 
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Figure 5. Average bias of the estimated MTD for each of the four models and nine scenarios when the time to DLT is generated 
from a non-proportional hazards model. /? 2 = 0.5 (left panel), /? 2 = 2.0 (right panel). 
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characteristics, we simulated 1000 trials under 3 scenarios for the 
true value of the MTD y. In each case, the probability of DLT at 
the initial dose is 0.05, the arrival times follow a time 



homogeneous Poisson process with rate equal to 3 patients per 
cycle, and n = 30 patients per trial. Table 2 shows the summary 
statistics based on 1000 trials. We can see that the estimated MTD 



s 



o 
o 



t 



True MTD = 0.3 



ft 2 $ 



True MTD = 0.5 



~i r 

2 4 



■ EWOC-PH 
♦ EWOC-W 
a EWOC 
o TITE-EWOC 



A 
♦ 

o 



True MTD = 0.7 



3 - 



o 
o 



A 

A O 

O £ t 



True MTD = 0.3 



True MTD = 0.5 



■ EWOC-PH 
♦ EWOC-W 
a EWOC 
o TITE-EWOC 



t a * 



True MTD = 0.7 



Accrual Rate 



Accrual Rate 



Figure 6. Mean square error of the estimated MTD for each of the four models and nine scenarios when the time to DLT is 
generated from a non-proportional hazards model. /? 2 = 0.5 (left panel), /? 2 = 2.0 (right panel). 
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Figure 7. Percent of trials with estimated MTD within 0.1 of / true for each of the four models and nine scenarios when the time to 
DLT is generated from a proportional hazards model with Weibull baseline hazard rate with parameters a- =0.5 (left panel), /c- 1.0 
(middle panel), and /c- 1.5 (right panel). 
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is close to the true underlying y and the overdose protection 
property of EWOC is illustrated by the last row of Table 2. 

Discussion and Conclusion 

We proposed a Bayesian adaptive design for dose finding studies 
with the distinctive feature that it takes into account the time for a 
patient to exhibit DLT. The method is an extension of EWOC 



where the dose allocated to a patient is based on the doses 
allocated to previously and currently treated patients and the time 
it takes to exhibit DLT. The design is dynamic in the sense that 
patients can enter the trial at any time and the dose allocated to a 
patient makes use of all the information available at the time the 
patient enters the trial. We used a proportional hazards model 
with exponential baseline hazard function ho(t;ju) to describe the 
dose-toxicity relationship for simplicity. The assumption of 
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Figure 8. Percent of trials with estimated MTD within 0.1 of /true for each of the four models and nine scenarios when the time to 
DLT is generated from a non-proportional hazards model. /? 2 = 0.5 (left panel), /? 2 = 2.0 (right panel). 
doi:10.1371/journal.pone.0093070.g008 
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Figure 9. Average proportion of DLTs for each of the four models and nine scenarios when the time to DLT is generated from a 
proportional hazards model with Weibull baseline hazard rate with parameters k- 0.5 (left panel), k- 1 .0 (middle panel), and k- 1 .5 
(right panel). 

doi:10.1371/journal.pone.0093070.g009 



constant baseline hazard function is reasonable since the 
parameter of interest y depends on h 0 (t;ju) only through the 
cumulative baseline hazard H Q (t]jJ) in the observation window [0, 
t] as shown in equation (2.5). 

Simulation studies showed that EWOC-PH has smaller average 
bias when estimating the MTD compared to EWOC, TITE- 
EWOC, and EWOC-W in the majority of scenarios considered 
and under two different model misspecifications. Moreover, the 



estimated probability that a prospective trial will result in an 
estimate of the MTD within 10% of the dose range of the true 
MTD is higher when using EWOC-PH relative to EWOC, TITE- 
EWOC, and EWOC-W in the majority of scenarios and under 
model misspecification. We have also shown that EWOC-PH 
results in a safe trial design under all scenarios and model 
misspecification considered here. Furthermore, EWOC-PH is 
practically robust with respect to trial safety and efficiency under 
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Figure 10. Average proportion of DLTs for each of the four models and nine scenarios when the time to DLT is generated from a 
non-proportional hazards model. /? 2 = 0.5 (left panel), /? 2 = 2.0 (right panel). 
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Figure 11. Percent of trials with DLT rate exceeding 40% for each of the four models and nine scenarios when the time to DLT is 
generated from a proportional hazards model with Weibull baseline hazard rate with parameters /c- 0.5 (left panel), k- 1 .0 (middle 
panel), and a~=1.5 (right panel). 
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reasonable model misspecification. Therefore, we conclude that 
EWOC-PH is a good alternative design for late-onset toxicity 
cancer phase I trials relative to TITE-EWOC. 

We applied this methodology to design a phase I cancer trial in 
which the length of the cycle is 10 weeks due to the use of 
radiation. In this trial, the PI could not provide an upper bound 
for the support of the MTD. However, based on previous clinical 
trials using ABT-888 in combination with Gemcitabine, the PI 



believes that the probability that the MTD exceeds 100 mg is 
about 10% a priori. Therefore, we used a class of correlated priors 
for y and p 0 described in [13]. To our knowledge, this is the first 
cancer phase I trial which uses time to DLT and an unbounded 
support for the MTD a priori. 

An important characteristic of EWOC-PH is the effect of the 
length of time a patient is under observation on the dose 
recommended to the next patient. We have shown that the longer 
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Figure 12. Percent of trials with DLT rate exceeding 40% for each of the four models and nine scenarios when the time to DLT is 
generated from a non-proportional hazards model. ^ = 0.5 (left panel), $2 = 2.0 (right panel). 
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Figure 1 3. A simulated trial with true MTD = 70 mg. The red pins 
indicate DLT and the size of the pins correspond to the time to DLT 
since patients are given the treatment. The green pins indicate that a 
patient did not experience DLT within one cycle of treatment. 
doi:1 0.1 371/journal.pone.0093070.g01 3 

is a patient under observation with no evidence of DLT, the higher 
is the recommended dose for the next patient, assuming the DLT 
status of the previously treated patients has been resolved. This 
intuitively attractive property holds for TITE-EWOG and TITE- 
CRM as shown by Theorem 1. We have also shown that if a 
patient exhibits DLT shortly after treatment starts, then the 
recommended dose for the next patient is much lower than the 
recommended dose had the previous patient experienced DLT 
later on in the cycle. This property is not shared by either TITE- 
EWOC or TITE-CRM since patients given the same dose and 
who exhibit DLT at different time points contribute the same 
information to the likelihood function. 

The design we presented is Bayesian and the dose allocated to 
the next patient corresponds to the estimate of the MTD y having 
minimal risk with respect to the asymmetric loss function l a (xj>) = 
a (y—x) if x<y, and l a {x 2 y) = (1 — a) (x—y) otherwise. As such, it 
belongs to the "type I Best Intension design" defined by Fedorov 
et al. [27]. While the original EWOC [12] design is consistent in 
probability under a one parameter logistic model, we do not have 
a proof that this new design is consistent due to its added 
complexity. We did some simulations (data not shown) and we 
observed that the average bias of the estimate of the MTD 
decreases as a function of the sample size in the trial under the true 
model. 

The methodology presented here assumes continuous dose 
levels of the agent under study, which is not uncommon in 
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Table 2. Design operating characteristics based on 1000 trial 
replicates. 



True MTD 



40 70 100 



Estimated MTD 47 74 99.5 

Percent of DLT 29 33.9 22.1 



doi:10.1371/journal.pone.0093070.t002 

practice. When a prospective trial uses a pre-specified number of 
dose levels, we can apply this method by rounding down the dose 
recommended by EWOC-PH to the nearest available dose in the 
trial as proposed in [11,17]. The corresponding design will likely 
suffer the drawbacks of large variability of the distribution of the 
number of cohorts treated at the MTD relative to the "Up-and- 
Down" designs, see Oron and Hoff [28]. However, it is important 
to note that the use of "Up-and-Down" designs is impractical in 
this setting due to the fact that the length of a cycle of therapy is 
much longer than the traditional 3 or 4 weeks follow up and the 
status of DLT must be resolved before enrolling the next cohort of 
patients. For instance, Table 1 shows that with an accrual rate of 2 
patients per month, the median duration of a trial using EWOG- 
W (or an "Up-and-Down") design is 64.5 months relative to 25.2 
months for EWOC-PH, which is practically not feasible. 

We plan to make available a software application for trial design 
and operating characteristics evaluation of cancer phase I trials 
using EWOC-PH. This would be an addition to our established 
Web based application of EWOC [29] . We are also working on 
adapting EWOC-PH to account for patients' baseline character- 
istics as in [15] and extending the binary DLT outcome to an 
ordinal toxicity grade, see [16] for an extension of EWOC to 
account for an intermediate grade of toxicity. Since our model 
makes use of the proportional hazards assumption, design 
operating characteristics under model miss-specification should 
always be studied under practical scenarios when designing 
prospective trials using this design. At the end of the trial, if there 
is evidence that the proportional hazards assumption is violated, 
standard techniques including the use of time varying covariate 
[30] will be carried out to analyze the data. Finally, extending 
EWOC-PH for determining the MTD curve of drug combination 
of two agents is under investigation. 
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